Speech Enhancement using Temporal Masking and Fractional Bark Gammatone Filters
نویسندگان
چکیده
A speech enhancement technique based on the temporal masking properties of the human auditory system is presented. The noisy signal is divided into a number of sub-bands with fractional bark accuracy, and the sub-band signals are individually and adaptively weighted in the time domain according to a short-term temporal masking threshold to noise ratio estimate in each subband. Objective measures and informal listening tests demonstrate significant improvements over three well-known existing methods when tested with speech signals corrupted by various noises at signal to noise ratios of 0, 10, and 20 dB.
منابع مشابه
A Gammatone-based Psychoacoustical Modeling Approach for Speech and Audio Coding
We propose a new approach for modeling auditory masking based on gammatone filters for application areas including speech/audio coding and audio watermarking. Besides the use of gammatone filters, this model differs from existing audio coding psychoacoustical models (e.g., the ones used in MPEG), in taking into account the contribution of a range of filters in computing the distortion, rather t...
متن کاملSpeech Enhancement by Modified Convex Combination of Fractional Adaptive Filtering
This paper presents new adaptive filtering techniques used in speech enhancement system. Adaptive filtering schemes are subjected to different trade-offs regarding their steady-state misadjustment, speed of convergence, and tracking performance. Fractional Least-Mean-Square (FLMS) is a new adaptive algorithm which has better performance than the conventional LMS algorithm. Normalization of LMS ...
متن کاملA Generalized Time–Frequency Subtraction Method for Robust Speech Enhancement Based on Wavelet Filter Banks Modeling of Human Auditory System
We present a new speech enhancement scheme for a single-microphone system to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-tonoise ratio. A psychoacoustic model is incorporated into the generalized perceptual wavelet denoising method to reduce the residual noise and improve the intelligibility of speech. The proposed method is a generalized tim...
متن کاملWideband speech and audio coding using gammatone filter banks
Considerable research attention has been directed towards speech and audio coding algorithms capable of producing high quality coded speech and audio, however few of these use signal representations which account for temporal as well as spectral detail. This paper presents a new technique for 16 kHz wideband speech and audio coding, whereby analysis and synthesis are performed using a linear ph...
متن کاملWavelet Filter Bank Based Robust Speech Enhancement
WAVELET FILTER BANK BASED ROBUST SPEECH ENHANCEMENT L.M. Kadam, D.S. Aldar, and B.B. Godbole K.B.P. College of Engineering and Polytechnic, Satara E-mail: [email protected], [email protected], [email protected] The paper investigate new speech enhancement scheme to meet the demand for quality noise reduction algorithms capable of operating at a very low signal-to noise ratio....
متن کامل